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ABSTRACT 

Objective: To summarise the available scientific 
evidence on the health effects of exposure to working 
beyond the limit number of hours established by the 
European Working Time Directive (EWTD) on physicians. 
Design: A systematic literature search was conducted in 
PubMed and EMBASE. Study selection, quality appraisal 
and data extraction were carried out by independent pairs 
of researchers using pre-established criteria. 
Setting: Physicians of any medical, surgical or 
community specialty, working in any possible setting 
(hospitals, primary healthcare, etc), as well as trainees, 
residents, junior house officers or postgraduate 
interns, were included. 

Participants: The total number of participants was 
14 338. 

Primary and secondary outcome measures: 

Health effects classified under the International 
Classification of Diseases (ICD-1 0). 
Results: Over 3000 citations and 110 full articles were 
reviewed. From these, 11 studies of high or 
intermediate quality carried out in North America, 
Europe and Japan met the inclusion criteria. Six 
studies included medical residents, junior doctors or 
house officers and the five others included medical 
specialists or consultants, medical, dental, and general 
practitioners and hospital physicians. Evidence of an 
association was found between percutaneous injuries 
and road traffic accidents with extended long working 
hours (LWH)/days or very LWH/weeks. The evidence 
was insufficient for mood disorders and general health. 
No studies on other health outcomes were identified. 
Conclusions: LWH could increase the risk of 
percutaneous injuries and road traffic accidents, and 
possibly other incidents at work through the same 
pathway. While associations are clear, the existing 
evidence does not allow for an established causal or 
'dose-response' relationship between LWH and 
incidents at work, or for a threshold number of 
extended hours above which there is a significantly 
higher risk and the hours physicians could work and 
remain safe and healthy. Policymakers should consider 
safety issues when working on relaxing EWTD for 
doctors. 



Strengths and limitations of this study 



The first systematic review, to the best of our 
knowledge, on the effects of long working hours 
(LWH) on physicians' health and safety. 
A systematic literature search conducted in 
PubMed and EMBASE with over 3000 citations 
and 110 full articles reviewed 
Eleven studies of high or intermediate quality 
carried out in North America, Europe and Japan 
involving physicians of any medical, surgical or 
community specialty, working in any possible 
setting (hospitals, primary healthcare, etc), as 
well as trainees, residents, junior house officers 
or postgraduate interns, were included. 
The findings of this review confirmed that LWH 
are associated with an increased risk of needle- 
stick injuries and road traffic accidents. Evidence 
was assessed as low or insufficient for mood 
disorders and general health. 
This review uncovered the lack of literature on 
the effect of LWH on the general health of 
doctors, a topic that has important health ramifi 
cations not only for physicians but also for 
patients. 



INTRODUCTION 

The European Working Time Directive 
(EWTD), 1 applicable to all occupations 
across the European Union (EU), requires a 
maximum working week of 48 h and estab- 
lishes rest periods. 2 3 Since it came into force 
in healthcare in 2009, it has been associated 
with concerns about the provision of health 
services including continuity of care, lower 
staffing levels, introduction of shift working, 
a reduction in training time and the 
adequate supervision of junior doctors. 4 5 
The large intercountry variations in physi- 
cians' working hours as well as the manda- 
tory controls over working hours across 
occupations raise further questions as to the 
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basis of selecting and setting these limits and restric- 
tions. 6 There is evidence of variable compliance to the 
EWTD within healthcare across Europe. 4 5 The medical 
profession is also increasingly feminised with potential 
implications for women of childbearing age, which may 
coincide with the time in the profession where long 
working hours (LWH) are more likely to happen (ie, 
during residency). Constraint on public sector finances 
and increasing healthcare demands have stimulated the 
debate on physicians' working hours, 7 with some evi- 
dence showing that the quality of healthcare is positively 
correlated with the number of physicians available to 
deliver that care. 8 

The Union of European Medical Specialists (UEMS) 
was created in 1958 to represent medical specialists in the 
EU, promote a high standard of training and practice at 
European level and facilitate the free movement of physi- 
cians across European countries. 9 The Occupational 
Medicine Section was created in 1997 and is involved in 
issues related to specialist training, professional practice 
and research 10 ; elaborating and publishing reference 
documents, 11 conducting and contributing to surveys 12-14 
and working in partnership with other key European 
groups. 

The trigger for this project was a request by the 
Council of the UEMS to its Occupational Medicine 
Section for a position statement on the EWTD and pos- 
sible health consequences to physicians of a potential 
removal of this professional group from the current 
EWTD 48h/week limit. 15-17 An evidence-based 
approach was taken by the Section, and a systematic 
review of the literature was undertaken. 15 

LWH have been associated with several adverse effects 
on workers' health, such as hypertension, cardiovascular 
disease, stress, depression, musculoskeletal disorders, 
chronic infections, diabetes, general health complaints 
and all-cause mortality in a variety of occupational 
groups. 18-26 In physicians, there is some evidence sug- 
gesting that LWH can result in impacts such as stress, 
depression, burnout, injuries, fatigue and sleep depriv- 
ation, 27-29 and overwork has been considered one of the 
most stressful features of physicians' work. 27 Some evi- 
dence exists of the beneficial effect of reducing the 
number of working hours on the quality of life of 
medical residents 30 and on burnout. 27 

However, the relationship between work and health is 
complex. There is a substantial body of evidence 
showing that worklessness is associated with poorer 
health, whereas work is generally good for health and 
well-being, 31 32 provided that the work environment is 
reasonably acceptable and supportive. 33 

Occupational and non-occupational exposures may 
play an important role such as the pattern and distribu- 
tion of working hours, breaks and recovery periods. 
Also, psychosocial and organisational factors such as 
workload, job control, managers and peer support, train- 
ing opportunities, and individual characteristics and atti- 
tudes may be important. 6 34 



Finally, physicians are a highly qualified, devoted and 
motivated professional group. Despite other existing 
reviews of a more general occupational approach, a 
review focused on physicians is warranted. 

The aim of this project was to systematically review 
whether LWH, defined as more than the 48 h/week limit 
imposed by the EWTD, are associated with health effects, 
classified under the International Classification of Diseases 
(ICD-10), 35 in physicians, and to examine what these asso- 
ciations are. This review does not include the potential 
impact of LWH on patient care or physician training. 

METHODS 

The systematic review sought to respond to the following 
research question: Are LWH associated with health 
effects in physicians? For the purpose of this review, 
exposure to LWH was defined as working for more than 
48 h/week. 1 Health outcomes included any disease as 
defined in the ICD-10 35 and work-related injuries. If self- 
assessed tools on health status were used, only those vali- 
dated were considered. Burnout, stress and outcomes 
such as individual symptoms, signs or biological markers 
(blood pressure, ECG, etc) were not considered. 
Physicians of any medical, surgical or community spe- 
cialty working in any possible setting (hospitals, primary 
healthcare, etc), as well as trainees, residents, junior 
house officers or postgraduate interns, were included. 

Study identification 

Electronic searches were carried out using PubMed and 
EMBASE as search engines (December 2011). Our 
search strategy was similar in both databases and con- 
sisted of a combination of the following keywords and 
MeSH terms: night shift, morning shift, evening shift, 
afternoon shift, shift work, rotating shift, shift combin- 
ation, shift duration or length, shift system, clockwise 
rotation, shift roster, extended shifts, night work, evening 
work, work schedule, work hours, starting time, early 
start, irregular working hours, direction of rotation, over- 
work, extended hours, shift rota, workload, work sched- 
ule tolerance, sleep deprivation, sleep disorders, 
chronobiology disorders, circadian rhythm, psychomotor 
performance, circadian disruption, vigilance, alertness, 
wakefulness, drowsiness, fatigue, insomnia, hypersomno- 
lence, dyssomnia, eveningness, morningness, neurocog- 
nitive performance, concentration difficulties, arousal, 
health, morbidity, mortality, disease, illness, stress, strain, 
distress, accident, injur*, death, suicid*, education, 
medical, physician, medical staff, hospital, doctor, 
surgeon, house officers, medical school, surgery, surgical. 

In addition, the reference lists of articles selected for 
inclusion were carefully reviewed to identify additional 
studies. 

Study selection 

Studies were included if they measured the association 
between the exposure to LWH and health effects in 
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physicians, and used an observational epidemiological 
(ie, cohort, case-control, cross-sectional) or experimen- 
tal design. Articles relating to on-call, night or shift 
work, but with unknown exposure or exposed to less 
than 48 working hours per week were excluded. They 
were also excluded if the working groups were other 
than physicians or addressed other exposures or out- 
comes (ie, patient safety, fatigue, sleep deprivation, 
social or family disruption). No limitation was set for 
languages. 

A total of 2036 citations were obtained from PubMed 
and 980 non-duplicated citations using EMBASE, yield- 
ing a total of 3016 citations that were all screened by inde- 
pendent pairs of reviewers. All titles and, where 
necessary, abstracts were reviewed. Ninety-two potentially 
suitable publications were identified from the electronic 
searches, and 18 further studies were identified from the 
reference lists, yielding a total of 110 studies published in 
English, French, German, Italian, Slovenian and Spanish. 
For all of them, the full text was obtained and reviewed 
by independent pairs of reviewers. Disagreements within 
pairs were resolved by discussion and, where necessary, by 
a third reviewer who made the final decision. All authors 
participated as reviewers for screening the citations and 
full papers using well-defined and pre-established cri- 
teria. Ninety-nine articles were excluded at this stage. 
One further study was identified that was published after 
the search period. Finally, 12 papers were considered for 
quality assessment (figure 1). 

The methodological quality of the 12 studies was 
assessed by independent pairs of reviewers. A standar- 
dised 16-point scale based on CONSORT and STROBE 
statements and adapted from a previous systematic 
review 36 was used. It includes 16 items grouped into six 
areas: (1) objectives; (2) study design; (3) target popula- 
tion and sample; (4) variables; (5) data sources, collec- 
tion and measurement and (6) statistical methods. Each 
item was rated as 1 (the requirement was met), 0.5 



(the requirement was partially met) or 0 (the require- 
ment was not or unclearly met). Disagreements within 
pairs of reviewers were resolved by consensus or, where 
necessary, by a third reviewer. For each of the 12 studies, 
a final score based on the sum of all items was assigned 
and the percentage was calculated based on a maximum 
score of 16. Study quality was rated as low, moderate or 
high if it scored less than 60%, between 60% and 79.9%, 
and 80% or more of the maximum score, respectively. 

One article of low quality was excluded, 37 and a total 
of 11 studies of intermediate or high methodological 
quality were included for the purpose of this review 
(figure 1). 

Data extraction 

Selected information was extracted from each paper, 
including publication year, country of origin, study 
design, setting, study population, sample size, response 
rate, measure of working hours, definition used for 
LWH, health outcomes and their measurement, main 
results on point risk estimates or frequencies of health 
outcomes, their corresponding 95% CI and whether the 
analyses were adjusted for potential confounders. 

Evidence synthesis 

To summarise the results on the relationship between 
LWH and health effects, levels of evidence synthesis was 
performed. This was based on the methodological 
quality, study design and consistency of the study out- 
comes. The following criteria were based on two previ- 
ous relevant systematic reviews: 

Strong evidence: consistent results in more than two 
studies of high quality. 

Moderate evidence: consistent results in one high- 
quality study and one intermediate, or between some 
studies of intermediate quality. 

Insufficient evidence: identification of only one study 
or inconsistent results across studies. 



Figure 1 Results of the search 
strategy, using search engines on 
PubMed and EMBASE 
(December 201 1) and screening 
of references lists of identified full 
papers, study selection and 
quality assessment. 



3,016 citations obtained from search strategies (2,036 PubMed, 980 Embase) 



Study selection: screening of title and, if needed, abstract. 



110 full papers assessed for eligibility 

(92 from electronic searches + 18 from references lists) 



Study selection: inclusion criteria 

1 additional study published in 2012 (January) was identified 



12 studies selected 



Quality assessment. 

Exclusion of one study of low quality. 



11 studies included in the review 
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Evidence of no association: consistent results of a non- 
association in two or more studies. 

A measure of the possible magnitude of the associ- 
ation was attempted using the following criteria based 
on the association point estimate (RR=relative risk, rate 
ratio or OR) when available; otherwise the comparison 
between two frequencies was used: 

► High: RR >3.00; 

► Intermediate: RR=1. 50-3.00; 

► Low: RR=1.01-1.49; 

► No association: RR=1.00; 

► Unclear. 



RESULTS 

We identified 11 studies related to LWH and health 
effects in physicians that were eligible for inclusion. The 
outcome of the methodological quality assessment is 
given in table 1. 

Methodological quality was appraised as high for 3 of 
the 11 studies, with scores ranging from 84% to 94%. 
The other eight studies were considered as intermediate 
quality as their scores ranged from 63% to 78%. The 
majority of all included studies received positive scores 
on items describing the study objectives, design and the 
study population (items from areas 1, 2 and 3), as well 
as the description of exposure and outcome variables 
(item 4). However, although data sources and collection 
were well described, reliability and validity of exposure 
to LWH and health outcome measures had low scores 
(items 5b and 5c). The statistical methods were in 
general appropriate; however, the confounding variables 
were not measured in several studies and, as can be 
seen, the studies did not describe how they addressed 
missing data (items 6a and 6b) . 

Study characteristics 

The characteristics of the included studies are described 
in table 2. 

Five had been carried out in North America (four in 
the USA 40 41 43 47 and one in Canada 50 ), five in 
European countries (Denmark, 49 Germany, 45 Norway 

43 48 42 

and Germany, Sweden and the UK ) and one study 
in Japan. 44 Four used a prospective cohort design 
(follow-up ranging from 6 to 20 months), two were case- 
crossover, four cross-sectional and one was a before and 
after study. Overall, the total number of participants was 
14 338 and included medical residents, junior doctors or 
house officers only working in hospitals, medical specia- 
lists or consultants, medical and dental practitioners, 
general practitioners, hospital physicians and one study 
included female physicians from a community service. 
Most studies (n=9) compared different working hour 
patterns within the same group of physicians, one com- 
pared physicians from two different countries 46 and one 
included other healthcare workers as the comparison 

43 

group. 



LWH was defined as more than 48 h/ week in most 
studies, and some did not provide a definition. Two 
studies considered LWH below 48 h/week 46 48 All the 
included studies investigated health effects associated 
with working more than 48 h/ week, with number of 
hours ranging between <30 and 110 h/week. One paper 
studied the effects of an increasing number of hours 
from 30 to more than 50 h/week. 44 Others studied more 
than 40, 49 47 48 and above 58 h/week 46 (two studies), and 
six studies referred to more than 70 h/ week, 40-43 47 50 
one of which compared 78-80 h/week with 90-110 
weekly hours. 47 Information on working hours was self- 
reported in nine studies, with reasonably good validity 
only in two studies 40 41 and reliability in one. 43 In the 
other two studies, it was obtained from established work 
schedules 47 50 

Identified health outcomes that fulfilled the inclusion 
criteria were percutaneous injuries, motor vehicle acci- 
dents, mood disorders and general health. Information 
was self-reported in all studies using well-known vali- 
dated questionnaires, except three studies that used 
documented information on motor vehicle crashes, 40 
reported percutaneous injuries 42 and antidepressant 
prescription data as a surrogate of depression. 49 

Summary of findings 

Table 3 shows the findings from each of the 1 1 studies. 

Percutaneous injuries: One study of high quality and one 
of intermediate quality, with non-adjusted analyses, 
showed consistent results in medical trainees. Ayas et af° 
found an increased risk of percutaneous injuries asso- 
ciated with working more than 20 h/ day compared with 
non-extended hour periods, except in intensive care 
units. Fisman et at 2 * found an association between self- 
reported fatigue and the number of working hours at 
the time of injury and a threefold increased risk of 
sharp injuries in medical trainees compared with other 
healthcare workers (median of working hours/week 
before the injury: 70 vs 40; p<0.001). 

Motor vehicle accidents: Two studies of high and inter- 
mediate quality showed that long working weeks were 
associated with an at least twofold increased risk. Barger 
et at x found adjusted increased risks for car crashes (OR 
2.3; 95% CI 1.6 to 3.3) and near miss accidents (OR 5.9; 
95% CI 5.4 to 6.3) associated with working extended 
shifts. Kirkcaldy et at b showed that non-adjusted inci- 
dence rates of traffic accidents on house visits was five 
times as much when physicians worked for a mean of 
58 h/week compared with 38 h/week (p<0.05). 

Mood disorders: Six studies of intermediate and high 
quality found contradictory results for mood disorders. 
Three of them, all of intermediate quality, provided evi- 
dence of an association between LWH and mental 
health problems. The prevalence of General Health 
Questionnaire (GHQ)-30 cases was higher for female 
physicians working above 50 h/ week than for those 
working 30 or less hours/week (p<0.05), 44 and an 
increased adjusted risk of impaired mental health was 
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Sweden 


Cross-sectional 


Primary 


General practitioners 


Overtime defined as working at 


Swedish SF-36, 


12.0 (75) 


Johansson 48 






care 


Sample size: 1004 (72%) 


least 47 h/week. Self-reported 


impaired mental 




Varma et at* 9 












health, self-reported 




Denmark 


Cohort, prospective 


Hospital 


Senior medical consultants 


Long work hours (>40 h/week). 


Depression: 


15.0 (94) 






(20 months) 




Sample size: 2790 (62%) 


Self-reported 


redemption of 
















anti-depressive drug 




Zahrai efa/ 50 












prescriptions 




Canada 


Cohort, prospective 


Hospital 


Orthopaedic surgery 


Night float (n=9): 77.8% did 


SF-36, mental health 


1 1 .0 (69) 






(6 months) 




residents 


>80 h/week at baseline; 71 .4% 


score, self-reported 












Sample size: 16 (not 


at follow-up 














specified) 


Standard call (n=7): 57.1% did 
















>80 h/week at baseline; 80% at 
















follow-up 






*Follow-up period in brackets for prospective cohort studies. 












BDI, Beck Depression Inventory II 


GHQ-12, General Health Questionnaire-12 items; HCW, Health Care Workers; SCLDS, Symptom Checklist Depression Scale. 
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rable 3 Results on the association between long working hours and identified health effects in physicians 



Health outcome 



Main results 



Adjustment by confounders 



Study ID 



Percutaneous injuries 




Percutaneous injuries 



Motor vehicle accidents 



Motor vehicle accidents 



Mood disorders, 
depression 




Mood disorders, 
GHQ-30 



Mood disorders, 
GHQ-12and SCLDS 
Mood disorders, SF-36 



OR (95% CI) for injuries during extended (>20 h/day) vs non-extended periods (first 
12 h at work): all percutaneous injuries: 1.61 (1.46 to 1.78); injuries reported to OH: 
1.83 (1.48 to 2.28); injuries in the ICU: 1.87 (0.69 to 5.04); injuries in the operating 
room or labour and delivery: 1.77 (1.49 to 2.09); injuries in the ICU, non-ICU or ED: 
2.17 (1.56 to 3.00) 

Medical trainees vs other healthcare workers: total median working hours: 70 vs 
40 h (p<0.001); median previous working hours at the time of injury: 6.5 vs 5 h 
(p<0.001). 

OR (95% CI) for self-reported fatigue: work >40 h in the past week: 3.50 (2.06 to 
5.92); work during more than 5d in past week: 4.20 (4.46 to 7.15); >12 h at work 
before injury: 8.58 (3.7 to 19.86). 

Incidence rate ratio (95% CI) for association between fatigue and injuries: all 
workers: 1.40 (1.03 to 1.90); medical trainees: 2.94 (1.71 to 5.07); other HCWs: 0.97 
(0.66 to 1 .42; p=0.001 for heterogeneity) 

ORs (95% CI), after extended shift (>24 h) vs non-extended shift (<24 h): 
Crashes=2.3 (1.6 to 3.3); near miss accidents=5.9 (5.4 to 6.3) 
Incidence rates of driving accidents on house visits: 

Long hours (mean 58.36 h/week)=0.10 (0.31) vs short hours (mean 38.17 h/week) 
=0.02 (0.18). p<0.05 

HR (95% CI) hours/week intervals and redemption of antidepressive drug 
prescription(reference group: 37-40 h/week): 

25-36 h: 0.83 (0.24 to 2.82); 41-44 h: 0.95 (0.50 to 1 .77); 45-49 h: 0.88 (0.43 to 
1.78); 50-54 h: 0.83 (0.32 to 2.14); 55-59 h: 0.67 (0.15 to 2.94); >60 h: 0.48 
(0.06 to 3.68). 

Cox regression analysis of work hours as a continuous variable: HR=0.93 
(0.76 to 1.13) 

Prevalence of cases by working time (hours/week) 

<30 h: 35.7% (41/1 15); >30-40 h: 39.0% (39/100); >40-50 h: 37.7% (26/69); >50 h: 
56.8% (46/81); p=0.0179 

Stepwise multiple logistic regression, >50 vs <50 h/week: parameter 0.635 
(p=0.0293) 

No association was found between number of hours worked in a week and 
depression 

Scores when night float vs standard call at baseline and follow-up: 
Mental health mean score (SD): baseline=57.33 (22.63) vs 65.71 (7.61); 
follow-up=52.00 (15.49) vs 60.80 (11.45); p=0.72 

Mental health component summary (SD), baseline=34.84 (14.06) vs 40.21 (7.61); 
follow-up=30.15 (10.71) vs 42.40 (6.23); p=0.39 

Regression analysis: increased number of hours in hospital correlated with 
significantly lower SF-36 scores in almost all domains 



Time of day and circadian influences 



Age and sex 



Ayas et af° 



Fisman et a? 3 



Age and sex 
Not adjusted 



Age, sex, marital status, medical specialty, 
decision authority at work, social support at 
work, quantitative work demands and 
previous redemption of AD drug prescription 



Marital status, medical facility, position and 
night duty 



Not adjusted 
Not adjusted 



Barger et a/ 41 
Kirkcaldy ef a/ 45 



Varma et a? 9 




Hayasaka et af A 

Firth-Cozens 42 
Zahrai et a/ 50 




Continued 



Health outcome 



Main results 



Adjustment by confounders 



Study ID 



Mood disorders, SF-36 
and BDI 



Mood disorders, SF-36 
General health, SF-36 



General health, SF-36 



General health, SF-36 



General health 



Improvement of levels before and after duty hours limitations from 90-1 10 to 78- 
80 h/week): 

Mental health SF-36: no statistically significant improvement 
Individual questions BDI before and after duty hours limitations, only energy level 
statistically significant 

OR (95% CI)for working 47 h/week or more: 

Impaired mental health, men: 1.59 (0.95 to 2.66); women: 1.86 (1.03 to 3.37) 
SF-36: 

Scores when night float vs standard call at baseline and follow-up: 
General health mean score (SD): baseline=62.1 1 (17.47) vs 77.57 (24.25); 
follow-up=56.43 (24.89) vs 84.20 (16.50); p=0.41 

Physical health component summary (SD): baseline=46.16 (13.15) vs 52.01 (13.33); 
follow-up=39.32 (9.80) vs 56.15 (2.18); p=0.015 

Vitality mean score (SD): baseline=51 .67 (14.58) vs 51.43 (15.74); follow-up=48.57 
(14.92) vs 51.00 (10.84); p=0.20 

Regression analysis demonstrated that the increased number of hours spent in 
hospital correlated with significantly lower general health, physical function, mental 
health, role emotional, social function and mental component summary scale scores 
(all p<0.05) 
SF-36, BDI: 

Improvement of levels before and after duty hours limitations from 90-1 10 to 
78-80 h/week: 

Physical health: no statistically significant differences 
SF-36: 

OR (95% CI) adjusted for age and time in practice, for working 47 h/week or more: 
Impaired general health: men=1.66 (1.00 to 2.77); women=1.59 (1.00 to 3.17) 
Self-rated health: 

OR (95% CI) of good self-rated health (logistic regression): 
Norwegian work time pattern*=1 .35 (1 .03 to 1 .77); working in Norway=4.17 
(3.02 to 5.73) 



Not adjusted 



Not adjusted 



Age and amount of time in practice 



Age, sex and country of work 



Stamp et af 7 




Sundquist and 
Johansson 48 

Rosta and 
Aasland 46 



*Not working more than 9 h a day and having more than 60 h a month on-call. 

AD, anti-depressive (Mood disorders, depression); BDI, Beck Depression Inventory II; ED, Emergency Department; GHQ-12, General Health Questionnaire- 12 items; ICU, intensive care unit; 
OH, Occupational Health (Percutaneous Injuries); SCLDS, Symptom Checklist Depression Scale. 
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found for male and female general practitioners who 
worked for 47h/week or more. 48 Zahrai et at found 
that increased number of hours spent by residents in 
hospital correlated significantly with lower mental health 
and Mental Component Summary Scale scores (SF-36). 
Three other studies, one of which is of high quality, did 
not find an association between depression or mood dis- 
orders and LWH. Varma et at 9 did not find higher 
adjusted risks of depression associated with increased 
number of hours in senior medical consultants, even 
when they worked for more than 60 h/ week. Depression 
or mood disorders measured with GHQ-12, Symptom 
Checklist Depression Scale (SCLDS) or Beck Depression 
Inventory (BDI), all validated tools for mental health, 
were not associated with increased number of working 
hours 42 nor improved after decreasing the number of 
hours from 90-110 to 78-80 h/week in residents. 47 

General health: Four studies of intermediate quality ana- 
lysed the association between LWH and general health. 
Zahrai et af° found that increased number of hours 
spent by residents in hospital correlated significantly 
with poorer general health, physical function and vitality 
using SF-36. A comparative study of physicians in two dif- 
ferent countries found that Norwegians showed higher 
non-adjusted prevalence of self-reported good health 
compared with physicians in Germany who worked 
longer hours. 46 An increased adjusted risk of impaired 
general health was found for male and female general 
practitioners who worked for more than 47 h/ week. 48 
However, following the implementation of reducing the 
number of hours from an average of 90-110 to 78-80 h/ 
week did not lead to an overall improvement of resi- 
dents' self-reported physical health. 47 

Synthesis of the evidence 

The levels of evidence synthesis obtained from the ana- 
lysis of the 11 studies included in this review are shown 
in table 4. 



There is moderate evidence of an association between 
LWH and percutaneous and motor vehicle accidents in 
physicians. This evidence comes from one study of high 
quality and another of intermediate quality for percutan- 
eous injuries and for motor vehicle accidents; the mag- 
nitude of this association could be intermediate. 

For mood disorders and general health, there is a low 
or insufficient level of evidence of an association with 
LWH in physicians. This comes from six studies on 
mood disorders and four on general health of high and 
intermediate methodological quality and inconsistent 
results among them. No conclusion can be drawn about 
the magnitude of such associations, even if they existed. 

DISCUSSION 

This review found moderate scientific evidence for a 
positive association of intermediate magnitude between 
exposure to LWH and percutaneous injuries and motor 
vehicle accidents in physicians. Evidence was assessed as 
low or insufficient for mood disorders and general 
health. 

To the best of our knowledge, this is the first system- 
atic review on the effects of LWH on physicians' health 
and safety. The extensive searches were restricted to 
indexed journals. We systematically identified, selected 
and assessed the methodological quality of studies by 
means of independent pairs of reviewers. The quality 
assessment form and the levels of evidence used in this 
review were based on CONSORT and STROBE state- 
ments and on previous systematic reviews. 36 38 39 To give 
some estimate of the potential magnitude of the effect, 
we added levels based on the point estimates of mea- 
sures of association provided by the studies. However, 
such levels had not been previously established. 

Strengths and limitations of the studies 

Four studies had a longitudinal prospective design, 
however only two of them used a robust methodology. 



Table 4 Available scientific evidence on the health problems associated with long working hours in physicians: levels of 


evidence synthesis 








Degree of 


Magnitude of the 


Health outcome 


evidence* 


associationt Studies ID 


Percutaneous 


++ 


++ Ayas et al, 40 Fisman et al 43 


injuries 




++ Barger et al 4A Kirkcaldy et at* 


Motor vehicle 


++ 


accidents 






Mood disorders 


+ 


± Varma etal 49 Hayasaka etal 44 Firth-Cozens, 42 






Zahrai etal, 50 Stamp etal 47 Sundquist and Johansson 48 


General health 


+ 


± Zahrai et al, 50 Stamp et al 47 Sundquist and Johansson 48 






Rosta and Aasland 46 


*Strong evidence (+++): consistent results in more than two studies of high quality; moderate evidence (++): consistent results in two studies 


of high quality, or one high-quality study and one intermediate, or between some studies of intermediate quality; insufficient evidence (+): 


identification of only one study or inconsistent results across studies; evidence of no association (-): consistent results of a non-association in 


two or more studies. 






tMagnitude of the association: it refers to the magnitude of the association point estimate (RR=relative risk, rate ratio or OR): high (+++) if RR 


>3; intermediate (++) if RR=1.5-3; low (+) if RR <1.5; no association (-); unclear (±). 
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Two other studies used a crossover design, which 
reduces the likelihood of individual variability and con- 
founding, and were considered well designed. Four 
studies, all rated as intermediate methodological quality, 
were cross-sectional in design and therefore it is difficult 
to draw causal relationships from them. Although 
reverse causality cannot be ruled out from cross- 
sectional designs, it is unlikely that poorer health deter- 
mines longer working hours than their healthier peers. 
One study used a quasi-experimental design but lacked 
a control group. Also, because of different designs used 
across the included studies, different estimates of the 
effects were used that may not be directly comparable. 

Other limitations include the variability, validity and 
reliability of working hours, which in most studies was 
self-reported. Although only those papers specifying the 
number of working hours were accepted, shift and night 
work might have worked as confounders too, as it is diffi- 
cult to acknowledge which proportion of the health 
effects observed was due exclusively to the LWH compo- 
nent and not due, for instance, to stress, fatigue or sleep 
deprivation. Moreover, the studies did not discriminate 
between time spent actively working and asleep while on 
call, though the EWTD considers all hours on call as 
working time. Likewise, it is possible that participants 
were aware that the study was conducted to assess asso- 
ciations between the number of hours worked and 
health outcomes. Therefore, there might be a volunteer 
bias in those doctors who work extended hours. To min- 
imise this bias, a case-crossover analysis has been per- 
formed in three of the included studies. 40 41 43 

Health outcomes were measured mainly based on self- 
report, with the exception of documented motor vehicle 
crashes, 40 reported percutaneous injuries 42 and register- 
notified antidepressive drug prescriptions. 48 No studies 
with other objective measures of health (eg, mortality) 
or mental disease (eg, hospital data) were identified. 
However, self-reported or perceived health was assessed 
by validated and widely used instruments, especially 
when health status (general, mental or physical health) 
or ill-health symptoms were evaluated. Nonetheless, this 
raises the possibility that the observed associations might 
reflect differences in propensity to report health pro- 
blems when they occur rather than true differences in 
the risk of worse health status. We cannot rule this out, 
but the higher propensity for reporting among those 
who work for longer hours and poorer work schedule 
patterns would have to relate specifically to the report- 
ing of ill health rather than non-ill health. It seems 
unlikely that major differences in propensity to report 
would extend to a more concrete outcome. 

The majority of the studies of this review took into 
account confounding variables in their analyses, such as 
sex and age. However, none of the studies analysed the 
potential effect of modifying factors, such as psycho- 
social aspects at work, including attitudes, motivation, 
job requirements, demands and content, organisational 
climate, social relationships at work, work satisfaction, 



supportive organisation or the relative number of physi- 
cians and other staff available for patient care. Neither 
were important aspects outside of work, such as life 
events or lifestyles, taken into account. The study of 
Japanese female physicians found significant differences 
by marital status with higher GHQ scores for those who 
were married. 44 There are significant differences in the 
number of physicians per capita across different coun- 
tries and this together with the structure and organisa- 
tion of health services are potential confounders too 
that were not addressed in the selected studies across 
countries. 

A relevant outcome from this review would be to 
establish a threshold number of extended hours above 
which there is a significantly higher risk. There is a lack 
of evidence for a dose-response relationship and nor 
does the evidence give any indication for a threshold 
number of hours that physicians could work and remain 
safe and healthy. 

Finally, the heterogeneity of the included studies did 
not allow a more quantitative synthesis, and a level of 
evidence approach was used instead. 

Comparison with other studies 

Our findings of an association between LWH and injur- 
ies and accidents in physicians are consistent with 
research in other occupational groups. Dembe et al 19 
carried out a longitudinal survey in the USA including 
more than 10 000 workers from a variety of occupations 
and settings. A strong dose-response effect (adjusted for 
age, gender, occupation, industry and region) was found 
between LWH and injuries above 40h/week and 8 h/ 
day; jobs with overtime schedules were associated with a 
61% higher injury hazard rate compared with jobs 
without overtime; working at least 12h/day was asso- 
ciated with a 37% increased hazard rate, and working at 
least 60h/week was associated with a 23% increased 
hazard rate. They concluded that LWH might indirectly 
precipitate workplace injuries through a causal process 
by inducing fatigue or stress in workers. In a separate 
study assessing injury risks to healthcare personnel, 
Dembe et af x demonstrated that the risk of injury when 
working overtime or at least 60 h/ week among physi- 
cians and nurses was statistically significant. In our 
review, we found that information about the length of 
shift work varied across studies, some analysing shifts of 
up to 24 h. Work that included such shifts was more 
strongly associated with accidents and injuries. 40 41 

We found insufficient evidence of an association 
between LWH and mood disorders, other diseases and 
general health in physicians, despite some scientific evi- 
dence suggesting that LWH increase morbidity and mor- 
tality in other occupations. 19-26 

Possible explanations for these inconsistencies might 
be the scarcity of high-quality research, and the fact that 
some of the included papers did not fully address the 
effects of confounding factors, and the possibility that 
working as a physician might have a protective effect by 
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itself, for instance, through higher motivation and social 
recognition relative to other occupations. 

Stamp et alf 7 in the only before and after study, did 
not find significant differences for mood disorder or 
general health before and after implementing require- 
ments to reduce working schedules to 80 h/week in resi- 
dents. It is debatable, however, whether any conclusions 
can be drawn from the impact of reducing extremely 
high working hours (90-110 h/week) to very high 
working hours (78-80 h/week). 

Although we searched for and identified literature 
describing associations between LWH and increased 
mortality in doctors, 52 53 none of those studies specified 
the number of hours worked, and therefore were not 
included. 

Recommendations for future research 

Further research, including well-designed prospective 
and retrospective longitudinal studies, is needed in this 
field to establish on scientific grounds what number of 
hours are safe, to patients as well as to physicians. Also, 
studies on long-term health effects should be considered. 

This review was aimed at studying exclusively possible 
detrimental health effects of LWH (more than 48 h/week) 
on physicians. Other systematic reviews should be con- 
ducted addressing issues such as the impact of working 
hours on the well-being and quality of life of doctors, the 
training quality of junior doctors, and the quality and con- 
tinuity of care and their impact on patient safety. Also, 
from a staffing perspective, the economic and social 
burden of sick leave and physicians leaving the profession 
are important issues that were beyond the scope of this 
review and that should be considered in future research. 
Other established criteria, 54 such as that working hours 
should be 'family friendly', promote gender equality, 
enhance productivity or facilitate worker choice and influ- 
ence over working hours, were not addressed. Further 
research is needed in these other areas. 

Conclusions 

The findings of this review have confirmed that LWH 
are associated with an increased risk of needlestick injur- 
ies and road traffic accidents. The fact that these associa- 
tions are labelled moderate in this review should not 
preclude safety procedures and measures being assessed 
and reassessed to ensure maximum physician and 
patient safety. The reduction of needlestick injuries 
requires improved use of safe sharp devices, compliance 
with safe working procedures, training and regular 
audits of working practices, particularly for junior 
doctors who are at greatest risk of needlestick injuries, 
possibly due to inadequate training. 55 56 The risk of 
road traffic accidents could be reduced by discouraging 
driving after very long shifts (more than 16 h) and 
where there is sleep deprivation. 57 Transport should be 
provided by the employer in these situations. This is a 
pragmatic recommendation which recognises that LWH 
have not been eliminated as a result of the EWTD. 



This review uncovered the lack of literature on the 
effect of LWH on the general health of doctors, a topic 
that has important health ramifications not only for phy- 
sicians but for patients as well. However, the paucity of 
available studies, which does not allow for a causal or 
dose-response relationship to be established, raises 
further questions about the evidence base of the current 
48 h limit, especially as this is not enforced strictly and 
junior doctors, for example, in the UK, may opt-out. 58 It 
may be relevant that the EWTD was not based entirely 
on science but also took into account, as does all EU 
legislation, the views of 'social partners'. 3 

It is likely that the mechanism for these increased 
health and safety risks is fatigue, and our finding of 
increased risk of accidents to staff may also be associated 
with increased risk of clinical errors. Patient safety was 
not the purpose of this review but LWH are well recog- 
nised to cause decrement in performance in healthcare 
and other professions, where the performance of staff 
can be safety critical, such as being airline pilots or pro- 
fessional drivers. 59 

In addition, the EWTD has been associated with real 
concerns about the adequacy of the training of doctors 
given their reduced exposure to patient care, and the 
lack of experience of the patient journey, imposed by 
adherence to the EWTD. 4 This is potentially exacer- 
bated at the same time by the constraints on the overall 
duration of training imposed by other EU directives. 60 
Further reviews of the EWTD, with regard to this unique 
group of workers, need to take into account other social 
factors such as the impact on patient care. This is par- 
ticularly the case when all healthcare systems are under 
strain because of burgeoning demands and limited 
resources. 4 

The findings of this systematic review lead to the sug- 
gestion of the following guidelines in the event of non- 
compliance with the EWTD: (1) long shifts should be 
avoided to protect physicians' health and patient safety 
and no shift should be longer than 16 h (and then 
exceptionally) 28 ; (2) physicians should be discouraged 
from driving after long shifts to reduce the risk of motor 
accidents 41 45 ; (3) rigorous attention should be paid to 
reduce the risk of sharp injuries 40 43 ; (4) organisational 
aspects such as workload and job control, as well as the 
pattern and distribution of working hours, breaks and 
recovery periods should be carefully taken into account, 
in order to avoid fatigue and sleep deprivation that 
could lead to mistakes and accidents 61 62 and (5) physi- 
cians should work in supportive psychosocial environ- 
ments, in teams and with adequate training and 
supervision as their well-being is important also for the 
health of their patients. 61 62 

Further research is required: to establish any longer 
term effects, such as on mortality and mental health; to 
assess the impact of the nature of the work organisation 
taking into account the psychosocial aspects of the physi- 
cians' working and non-working lives and to determine 
how many hours are safe for physicians to work. 
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